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PREFACE 


This  report  represents  the  first  chapter  of  a  book  in  preparation 
on  Inference  and  data  analysis  in  reliability  and  life  testing.  The 
point  of  view  adopted  differs  from  that  of  most  books  on  the  subject 
In  the  following  basic  respect:  Prior  information  available  to  the 
reliability  aruxlyat  is  utilized  fully  in  a  formal  atatiatioal  fashion. 
Experience  accumulated  In  helping  engineers,  quality  assurance  managers, 
scientists,  blostatlstlclans,  and  others  who  must  make  estimates  and 
reach  decisions  from  either  planned  experiments  or  retrospective  data 
has  shown  us  that  the  point  of  view  adopted  throughout  the  book  has 
resulted  In  useful  solutions  to  real-life  problems.  By  contrast,  more 
classical  statistical  methods  have  often  proven  Inadequate  In  many  prac¬ 
tical  problems  simply  because  the  data  available  are  Insufficient  to  reach 
conclusions  with  a  desired  degree  of  assurance. 

The  book  Is  Intended  primarily  for  actual  use  by  the  engineering 
and  scientific  practitioner,  rather  than  for  theoretical  study  and 
philosophical  analysis  by  the  statistician.  Thus  we  omit  a  philosophical 
justification  of  the  methods  presented;  rather,  we  rely  on  the  fact  that 
they  have  led  to  useful  answers  to  problems  that  have  arisen  In  practice. 

One  final  point:  Many  of  the  methods  and  results  are  original  and 
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ABSTRACT 


This  paper  constitutes  Chapter  1  of  a  book  on  "Inference  and 
Data  Analysis  for  Reliability  and  Life  Testing.”  The  purpose 
Is  to  provide  an  elementary  coherent  Bayesian  foundation  for 
research  In  statistical  reliability  theory.  The  results  In 
Section  3  and  the  Appendix  concerning  the  role  of  failures 
and  total  time  on  test  In  statistical  Inference  for  the  expo¬ 
nential  model  Is  new. 
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INFERENCE  FOR  THE  EXPONENTIAL  LIFE  DISTRIBUTION 


by 

Richard  E.  Barlow  and  Frank  Pros chan 

Our  objective  Is  to  develop  methodology  for  analyzing  life  test 
data.  Initially,  we  have  only  data — no  mathematical  models.  Through 
an  exploratory  data  analysis  (see  Chapter  3)  or  an  analysis  based  on 
the  physical  processes  generating  the  data,  we  may  select  an  exponen¬ 
tial  life  distribution  model  as  appropriate  for  the  analysis  of  the 
data.  Of  course,  this  may  be  at  best  only  an  approximation  to  the 
"true”  underlying  life  distribution.  With  this  reservation  in  mind, 
we  suppose  the  life  distribution  F  is  exponential;  specifically: 

(0.1)  F(x  I  X)  •  1  -  e’^^  ,  X  >  0  ,  X  >  0  , 

where  X  is  the  unknown  constant  failure  rate.  The  vertical  bar 
in  F(x  I  X)  Indicates  that  we  are  conditioning  on  the  parameter  X 
l.e.,  for  specified  X  the  distribution  is  exponential  with  failure 
rate  X  .  The  corresponding  density  Is: 

(0.2)  f(x  I  X)  -  Xe  ,  X  ^  0  ,  X  >  0  . 
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1.  BASIC  CONCEPTS 

To  begin  with,  we  suppose  the  life  test  data  consist  of  observed 
complete  lifetimes  x^,X2i  . . . ,  x^  on  n  units.  For  example. 

Table  1.1  lists  ordered  llfelengths  of  100  Kevlar  49/Epox'/  strands 
subjected  to  a  high  static  load  (Barlow,  Toland  and  Freenan  (1979)). 

An  exploratory  data  analysis  Indicates  that  an  exponential  life  dis¬ 
tribution  may  be  appropriate  for  analyzing  these  llfelengths.  Thus,  we 
assume  that  the  observations  constitute  a  sample  of  n  Independent, 
Identically  distributed  random  variables  with  distribution  F  given 
by  (0.1).  Although  we  assume  a  fixed  X  exists  which  specifies  F  , 
we  are  uncertain  as  to  the  true  value  of  X  ,  and  seek  a  method  which 
uses  the  data  to  express  probabilistically  our  uncertainty  regarding 
the  true  value  of  X  . 

The  first  step  Is  to  evaluate  the  joint  probability  density  of 

the  random  lifetimes  X, ,  . . . ,  X  evaluated  at  the  observed  values 

X  n 

x^ ,  . . . ,  x^  .  since  we  are  assuming  that  X^ ,  . . . ,  X^  are  Indepen¬ 
dent  given  X  ,  the  joint  density  of  the  observed  values  is: 

n  f(x  1  X)  ■  x”  exp  (-X  I  X  )  . 

1-1  ^  \  1-1  V 


(1.1) 
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TABLE  1.1 

TIMES  TO  FAILURE  OF  STRANDS  SUBJECTED 
TO  STRESS  AT  80%  OF  MEAN  RUPTURE  STRENGTH 


Rank 

Hours 

Rank 

Hours 

Rank 

Hours 

Rank 

Hours 

1 

1.8 

26 

84.2 

51 

152.2 

76 

285.9 

2 

3.1 

27 

87.1 

52 

152.8 

77 

292.6 

3 

4.2 

28 

87.3 

53 

157.7 

78 

295.1 

4 

6.0 

29 

93.2 

54 

160.0 

79 

301.1 

5 

7.5 

30 

103.4 

55 

163.6 

80 

304.3 

6 

8.2 

31 

104.6 

56 

166.9 

81 

316.8 

7 

8.5 

32 

105.5 

57 

170.5 

82 

329.8 

8 

10.3 

33 

108.8 

58 

174.9 

83 

334.1 

9 

10.6 

34 

112.6 

59 

177.7 

84 

346.2 

10 

24,2 

35 

116.8 

60 

179.2 

85 

351.2 

11 

29.6 

36 

118.0 

61 

183.6 

86 

353.3 

12 

31.7 

37 

122.3 

62 

183.8 

87 

369.3 

13 

41.9 

38 

123.5 

63 

194.3 

88 

372.3 

14 

44.1 

39 

124.4 

64 

195.1 

89 

381.3 

15 

49.5 

40 

125.4 

65 

195.3 

90 

393.5 

16 

50.1 

41 

129.5 

66 

202.6 

91 

451.3 

17 

59.7 

42 

130.4 

67 

220.2 

92 

461.5 

18 

61.7 

43 

131.6 

68 

221.3 

93 

574.2 

19 

64.4 

44 

132.8 

69 

227.2 

94 

653.3 

20 

69,7 

45 

133.8 

70 

251.0 

95 

663.0 

21 

70.0 

46 

137.0 

71 

266.5 

96 

669.8 

22 

77.8 

47 

140.2 

72 

267.9 

97 

739.7 

23 

80.5 

48 

140.9 

73 

269.2 

98 

759.6 

24 

82.3 

49 

148.5 

74 

270.4 

99 

894.7 

25 

83.5 

50 

149,2 

75 

272.5 

100 

974.9 
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The  Likelihood  Function 

To  focus  attention  on  the  parameter  of  Interest  X  ,  we  regard 

(1.1)  as  a  function  of  X  and  call 

(1.2)  L(X  I  Xj^,  ...,  x^)  *  X*^  exp  |-X  x^^ 

the  likelihood  funation.  (The  likelihood,  although  a  function  of  the 
parameter  X  ,  Is  not  a  probability  density  In  the  parameter.  Hence 
the  vertical  bar  In  L  does  not  signify  a  conditional  probability, 
as  It  usually  does.)  The  likelihood  function  provides  a  means  of 
quantifying  the  Information  contained  In  the  data  concerning  the 
unknown  true  value  of  the  exponential  parameter  X  . 

Suppose  a  unique  value  X  of  X  exists  maximizing  the  likelihood 

A 

function;  then  we  call  X  the  mode  of  L(X  |  x^,  ...,  x^)  and  the 
maximum  likelihood  estimator  (MLE)  of  X  .  In  general,  the  MLE,  when 
It  exists.  Is  a  very  useful  concept. 

To  simplify  the  calculation  of  X  ,  we  use  the  fact  that  the 
maximum  of  the  likelihood,  when  It  exists.  Is  achieved  at  the  same  value 
of  X  as  Is  the  maximum  of  the  logarithm  of  the  likelihood.  Thus  we 
compute 


^  In  L(X  1  x^.  ....  x^)  x^  , 


and  set  the  derivative  equal  to  0  .  We  readily  obtain 


"(  Jx  '  • 


and  verify  that  X  maximizes  L(X  [  Xj^,  ...»  x^)  for  fixed  Xj^,  ...,  x^ 
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The  MLE  X  may  be  a  very  satisfactory  estimator  of  the  unknown 
failure  rate  X  for  moderate  to  large  sample  sizes  n  .  (Caution: 

For  more  complex  life  distribution  models  Involving  an  Infinite  number 
of  unknown  parameters,  the  MLE  of  the  parameters  may  be  quite  mislead¬ 
ing.  See  Marshall  and  Proschan  (In  process)  for  an  example  in  which 
the  MLE  converges  to  the  wrong  set  of  the  parameter  values  even  though 
the  sample  size  tends  to  infinity.  Also  see  Lindley  (1972),  p.  12,  for  a 
similar  two  parameter  example  and  Basu  (1975),  p.  34,  for  a  one  para¬ 
meter  example.)  In  the  present  case  of  estimation  of  the  single  para¬ 
meter  X  of  the  exponential,  our  uncertainty  as  to  the  true  value  of 
X  stems  from  the  fact  that  our  sample  size  n  Is  finite. 

A  good  way  co  express  our  uncertainty  concerning  X  Is  by  means 
of  a  probability  distribution  for  X  .  To  display  explicitly  this  point 
of  view,  we  let  X  denote  a  random  variable  expressing  our  uncertainty 
concerning  the  unknown  true  value  of  X  . 

Bayes '  Theorem 

A  key  theorem  based  on  this  point  of  view  Is  the  fundamental  Bayes' 
Theorem.  It  provides  a  method  for  computing  the  probability  density  of 
the  random  variable  expressing  our  uncertainty  concerning  the  parameter 
conditioned  on  the  observed  data. 

1.2  Theorem  (Bayes'  Theorem) 

Let  (a)  X  and  6  be  random  variables  with  joint  probability 
density  p(x,0)  ,  (b)  p(x  |  0)  and  p(0  |  x)  denote  the  correspond¬ 
ing  conditional  densities,  and  (c)  ir(0)  denotes  the  marginal  density 

of  0  .  Let  be  the  parameter  space,  i.e.,  0  Then 
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(1.3)  p(0  I X) « LgiiCi?— . 

/p(x  i  e)iT(e)d0 

© 

Proof; 

The  joint  density  p(x,0)  of  X  and  0  may  be  written  as: 
p(x,0)  -  p(x  I  0)it(0)  . 

By  definition  of  a  conditional  probability  density, 

p(0  I  x)  -  p(x,0)/p(x)  , 

when  p(x)  >  0  ,  where 

p(x)  f  p(x  1  0)ir(0)d0  . 

© 

By  combining  the  three  equalities  in  the  steps  just  above,  we  immed¬ 
iately  obtain  the  desired  conclusion  (1.3).|| 

Prior  and  Posterior  Distributions 

Before  analyzing  statistical  data.  It  is  helpful  and  efficient 
to  assess  relevant  Information.  A  convenient  way  to  accomplish  this 
Is  to  formulate  a  probability  density  on  the  parameter(s)  of  the 
model  selected.  We  realize  that  generally  we  cannot  be  certain  that 
the  model  selected  is  the  "true"  model  and  that  the  prior  distribution 
for  the  parameter(s)  specified  for  the  model  Is  perfectly  correct. 
However,  once  we  select  an  appropriate  model,  and  a  prior  distribution 
on  the  parameter  space  for  that  model,  we  may  complete  a  useful  and 
Informative  data  analysis  In  an  unambiguous  fashion  using  only  the 
standard  calculus  of  probability  theory. 
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The  Prior  Density 

First,  we  confine  our  choice  of  prior  densities  to  proper 
densities;  l.e. ,  ir(X)  satisfies 


ir(X)dX 


1  . 


Next,  to  motivate  the  concept  of  a  natural  conjugate  prior  for 
X  ,  we  suppose  that  In  the  particular  problem  under  discussion,  we 
have  very  little  prior  Information  concerning  X  .  It  seems  natural 
to  assume  Initially  a  rectangular  prior  density: 


ir  (X) 
o 


for  0  £  X  i  M 
0  otherwise  , 
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where  M  is  a  very  large  number  (say  M  ■  10  )  .  Under  this  assump¬ 

tion  on  the  prior  density,  we  assign  the  same  probability  that  X  Is 
In  any  Interval  In  [0,M]  of  a  specified  length.  For  example,  the 
a  priori  probability  that  5  ^  X  £  10  Is  the  same  as  the  a  priori 
probability  that  19  ^  X  ^  24  . 


The  Posterior  Density 

Suppose  we  have  observed  a  sample  of  n  Ilf elengths  ,  . • • ,  x^ 
having  joint  density  p(x^,  ...,  x^  |  X)  .  By  Bayes'  Theorem,  the 
posterior  density  of  X  Is  given  by: 


1T^(X  I  x^,  x^)  -  p(Xj^, 


X  I  X)Tr  (X)dX 
n  o 
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Recall  that  the  likelihood  L(A  |  t  . • . t  x  )  -  p(x. ,  x  |  A)  , 

n  1  Q 

namely  the  probability  density  of  the  observed  outcome  considered  as  a 
function  of  the  parameter  A  .  Thus  we  may  write: 


"l’ 


*n> 


L(A  I 


*1’ 


X  )ir_(A) 
n 


fi 


L(A 


‘1* 


,  X  )ir  (A)dA 
n  o 


Since  A  has  been  Integrated  out  in  the  denominator,  the  denominator  Is 
now  a  constant  with  respect  to  A  .  Hence 


tTj^CA  1  Xj^,  ....  x^)  «  L(A  I  Xj^, 


X  )1T  (A)  , 
n  o 


where  «  means  "proportional  to."  Thus  the  right  hand  side  Is  the  same 
as  the  left  hand  side  up  to  a  constant  which  does  not  depend  on  the  para¬ 
meter  A  .  Notice  that  since  the  data  x, ,  . . . ,  x  have  already  been 

In 

observed,  the  data  are  not  considered  variables  at  this  stage  of  the 
analysis. 

Assuming  the  rectangular  prior  it  (A)  -  M  ^  for  0  <  A  <  M  ,  we 

o  ~  — 

obtain  for  the  posterior  density  of  A  : 


/  n  \  /  /  n  \ 

tTj^(A  I  Xj^,  ...,  x^)  *  M  ^a”  exp  |-A  \  y  ^  ^  ' 

-1  "  -1 

For  n  ^  X.  »  M  ,  this  is  approximately: 

1  ^ 

(1.4)  ir^(A  I  Xj^,  ...,  x^)  ^  x^j  a“  exp  |-A  I  x^j|yr(n  +  1)  , 
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go 

where  r(n  +  1)  *  J*  u'^e  *^du  is  the  gamma  function.  In  computing  (1.4) 

o  CB 

we  have  used  the  fact  that  j*  c'^^u^e'^^^du  -  r(n  +  1)  for  all  c  >  0  . 

o 

Thus,  if  we  assume  initially  a  rectangular  prior  on  [0,M]  ,  M  large, 
the  resulting  posterior  density  is  approximately  of  the  form: 


(1.5) 


bVV*’^/r(a)  for  X  >  0  , 


where  a  ,  b  >  0  .  Note  that  the  posterior  density  is  approximately  a 
gamma  density  with  shape  parameter  a  <■  n  +  1  and  scale  parameter 
n 

b  -  I  x  . 
i-1  ^ 

Now,  suppose  we  obtain  an  additional  independent  random  sample  of 

lifelengths  y,  >y.,«  ...,  y  .  Then  it  is  reasonable  to  use  as  our  nei) 
±  ^  n 

prior  density,  the  posterior  density  (1.4)  obtained  from  the  previous 
sample.  Using  as  our  new  prior: 


itj^(X)  -  b^®"^e‘*’^/r(a) 


with  a  ■  n  +  1  and  b  ■  ^  x,  ,  we  obtain: 

1  ^ 


I  ^1’  •••’  V  "  ^ 


x“  exp  |-X  I  y^jb®X®"^e"*’^/r(a) 
J*  X”  exp  |-X  ^  yj^j[b®X®"^e"*’Vr(a)]dX 


IT  (X  1  y  ,  . . . ,  y  ) 


(b  +  I  exp  (-x(b  +  I  y^)^/r(„  +  a)  . 


(1.6) 


Thus,  a  is  Increased  by  the  additional  number  m  of  observed  failures 

m 

and  b  Increased  by  the  additional  total  time  on  test  ^  y.  to  obtain 

1  ^ 

the  new  posterior  density  »  ''°te  however,  that  the  form  of  the  pos¬ 
terior  density,  the  gamma,  is  retained. 

Because  of  this  preservation  property  (the  gamma  prior  used  In  the 
exponential  model  leads  to  a  gamma  posterior),  the  gamma  prior  Is  called 
the  "natural  conjugate"  prior  for  the  parameter  X  In  the  exponential 
model.  More  generally,  a  conjugate  prior  distribution  is  "conjugate" 
with  respect  to  a  given  statistical  model  If  the  form  of  the  posterior 
Is  the  same  as  that  of  the  prior;  the  parameters  of  the  posterior  dis¬ 
tribution  will  of  course,  change  In  accordance  with  the  data  observed. 

In  the  present  case,  we  can  interpret  the  prior  density  parameter  a  -  1 
(If  a  Is  an  Integer  >  1)  as  the  number  of  observations  in  a  previous 
experiment  (actual  or  conceptual)  and  b  as  the  corresponding  total 
time  on  test. 

In  the  present  exponential  model,  the  gamma  prior  for  X  Is  mathe¬ 
matically  convenient  and  has  Intuitive  interpretation.  However,  the 
analyst  Is  not  confined  to  a  choice  with  this  family.  Rather,  the 
choice  of  the  prior  distribution  should  always  reflect  the  best  possible 
specification  of  the  analyst's  prior  Information  concerning  the  unknown 
parameter.  Thus,  In  reporting  the  results  of  the  data  analysis,  the 
analyst  should  present  his  specification  of  the  prior  and  the  basis  for 
his  choice.  In  Chapter  3,  we  will  discuss  In  more  detail  the  assignment 
of  the  prior  distribution. 

1.3  Example 

Table  1.1  lists  the  observed  lifetimes  of  100  orgalnlc  fiber  strands 
subjected  to  a  static  load  of  3009  grams  which  corresponds  to  80%  of 
their  mean  rupture  strength.  Experience  has  shown  that  the  lifetime 
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of  an  organic  fiber  strand  at  relatively  high  stress  can  be  reasonably 
well  fitted  by  an  exponential  life  distribution.  Thus,  we  assume 

P[Lifelength  >  x]  ■  e  for  x  ^  0  , 

i 

where  X  is  unknown.  As  described  above,  we  calculated  the  MLE  of  X 
as: 

X  ■  4.78  X  10  ^/hour  . 

Since  the  sample  size  of  100  is  moderately  large,  the  likelihood 


will  override  in  Importance  the  retangular  prior: 

ir^(X)  -  m"^  ,  0  <  X  <  M  , 

when  M  >>  X  .  From  the  prior  ir^  we  calculate  the  posterior  density 
of  X  to  be  approximately  a  gamma  with  parameters  a  >  101  and  b  «  20,917 
hours.  (Note  that  a  is  dimensionless  while  b  is  measured  in  hours). 

See  Figure  1.1.  The  mode  of  the  posterior  density  may  be  used  to  esti¬ 
mate  X  .  In  Chapter  2  we  discuss  other  estimators. 

Sufficient  Statistics 

By  a  statiatiOf  we  mean  a  function  (possibly  vector-valued)  of  the 
data.  A  statistic,  of  course,  is  often  used  to  estimate  an  unknown 
parameter  of  Interest.  Clearly,  the  complete  set  of  data  observed  is 
trivially  a  statistic.  For  a  large  set  of  data,  working  with  all  of  the 
observations  may  be  tedious  or  even  unmanageable.  Thus,  we  are  motivated 
to  find  a  statistic  of  smaller  dimensionality  like  the  sample  mean  (of 


LAnaoA 
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dimension  one)  or  the  number  of  failures  acnd  the  total  time  on  test  (of 
dimension  two) ,  but  which  contains  all  of  the  information  in  the  sample 
concerning  the  parameter.  Preferably,  we  would  like  to  estimate  the  para¬ 
meter  using  a  statistic  of  lower  dimensionality  which  summarizes  all  of  the 
information  in  the  sample.  We  will  often  use  D  to  denote  the  observed 
data.  For  example,  D  could  stand  for  the  vector  of  observed  values 
(x, ,x.,  ...,  X  )  .  Later  it  will  denote  more  complicated  data  sets. 

This  motivates: 

1.4  Definition 

Let  D  denote  the  data  with  probability  density  p(D  1  0)  indexed 
by  the  parameter  0  .  A  statistic  s  is  sufficient  for  0  if  and  only 
if  for  every  prior  ff(0)  ,  the  posterior 

ir(e  I  D)  -  p(D  I  0)ir(0)/  j \  0):r(0)d0 

H 

depends  on  the  data  D  only  through  the  statistic  s  ;  l.e.,  for  every 
prior  IT  ,  the  posterior  can  be  written  as  ir(0  |  s)  . 

Intuitively,  knowing  s  we  are  as  informed  about  the  parameter  0 
as  when  we  know  all  the  data  collected. 

There  may  be  several  sufficient  statistics  available  for  estimating 
a  parameter.  Clearly,  from  among  these  we  would  prefer  to  make  use  of 
one  of  lowest  dimensionality.  For  a  given  parameter,  there  actually  may 
be  more  than  one  sufficient  statistic  of  lowest  dimensionality  and  it  may 
be  vector  valued. 

An  easy  way  to  find  a  sufflclenct  statistic  is  to  examine  the  likeli¬ 
hood  for  the  kind  of  factorization  displayed  in: 


Let  the  likelihood  L(0  |  D)  factor  so  that: 


L(0  I  D)  -  g(s  I  0)h(D)  , 

where  h  does  not  depend  on  0  .  Then  s  is  a  sufficient  statistic 
for  0  . 


Proof  I 

For  an  arbitrary  prior  density  ir  ,  the  corresponding  posterior 
density  is  given  by: 


ir(0  1  D)  -  L(0  I 


D)ir(0)/  rL(0  I  D)ir 

®/f 

0)h(D)Tr(0)/  /  g(s  I 


(0)d0 


■  g(s  I  0)h(D)Tr(0)/ J  g(s  I  0)h(D)iT(0)d0 


g(a  1  S) 


g(s  1  0):i(0)d0  . 


The  last  expression  clearly  depends  on  D  only  through  the  statistic 
s  .  Thus  by  Definition  1.4,  s  is  sufficient  for  0  .  | | 

For  example,  if  x^^,  x^  are  Independent  lifelengths  in  the 


exponential  model,  given  X  ,  then  s  *^2^  ^  x,  is  a  sufficient  statistic 

1  ^ 

for  the  failure  rate  X  since: 
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The  Sample  Space 

The  acanpZe  space  Is  the  space  or  set  of  possible  sample  outcomes. 
If  we  observe  the  lifetimes  of  n  units,  the  sample  space  Is 

(1.7)  S  •  {  ....  x^)  1  x^  ^  0  ,  1  £  1  £  n}  . 

For  n  >  2  ,  S  Is  the  positive  quadrant;  l.e.. 


However,  we  may  Just  as  well  consider  another  sample  space.  Suppose 
we  are  told  only  the  ordered  lifetimes 

*(1)  -*(2)  -  •••  -*(n)  ’ 

l.e.,  we  no  longer  know  which  unit  falls  at  time  x^J  ,  1  £  1  £  n  .  The 
sample  space  corresponding  to  the  ordered  lifetimes  is  now 

(1.8)  -  Ux^j,x^2)*  •••«  *(n)^  1  °  -*(1)  -  •••  -  *(n)^  ’ 


/ 
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For  n  ■  2  ,  we  now  have 


For  sample  space  (1.7)  and  the  exponential  model  we  have  Che  joint 
probability  density 


(1.9) 


p(Xj^,X2,  ....  x^  I  X) 


ir* 


,  l^i^n 
otherwise  . 


For  this  case  L(1  |  x^.x^.  ....  x^)  -  p(Xj^,X2,  ....  x^  |  A)  . 

On  Che  other  hand,  for  sample  space  (1.7)  and  the  exponential  model 

we  have 


(1.10) 


Po(X(ij,  ....  x^^j  I  A) 


-A  ^ 


(i) 


0  1*(1)  1  •••  lX(n) 


0 


otherwise  . 


The  factor  nl  in  (l.lO)  follows  from  the  fact  that  the  ordered  observa¬ 
tions  can  result  from  any  one  of  n!  permutations  of  the  observations 
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X. X  .  For  this  case  let 
14  n 

^o^^  I  *(!)»  •••*  *(n)^  “  ^o^*(l) . *(n)  ^  * 

From  (1.9)  and  (1.10)  we  see  that 

(1.11)  L(X  I  x^ •  X2 9  ...j  ^  ^(1)*^(2)*  ^(n)^  * 

It  follows  that,  for  any  prior  for  A  ,  the  posterior  density  for 
be  the  same  no  matter  which  of  the  above  sample  spaces  we  choose. 
(1.11)  and  Lemma  1.5  we  see  that  the  order  statistics  x^^,x^2)> 
are  sufficient  for  X  . 


A  will 
From 

....  , 
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2.  SELECTED  LIFE  TEST  SAMPLING  PLANS 

In  this  section  we  Illustrate  the  application  of  the  concepts 
and  methods  of  Section  1  when  estimating  under  each  of  several  commonly 
used  life  test  sampling  plans. 

Sampling  Plan  (a) .  Complete  Observation  Until  A  Specified  Number  of 
Failures  Have  Occurred 

A  popular  plan  consists  of  putting  n  items  on  test  and  observing 
the  failure  times  of  the  first  k  failures,  where  k  is  specified  in 
advance.  The  motivation  for  following  this  plan  is  to  save  time  in 
determining  an  estimate  of  the  exponential  failure  rate. 

Let 


*(1)  -  *(2)  -  •**  -  *(k) 


,  1  <.  k  £  n 


‘(1) 


,  ....  are  called  the  first  k  order  statistics  in  a  sample 


denote  the  successive  times  of  failure  of  the  earliest  k  failures; 

X 

of  size  n  .  The  likelihood  of  this  observed  outcome  under  the  expo> 
nential  model  is  given  by; 


(n-k)Xx^j^j 

(Ue  follow  the  usual  convention  that  0!  =1  •) 

Simplifying,  we  have: 


L(X  I  D) 


n! 


1! 


l!(n  -  k) 


k  -Xx 
7  n  Xe 
'  .1*1 


(i) 


(2.1)  L(X  I  D)  -  -  3^-^ ,  x'"  exp  | -X 


k 

I  X 

i-1 


(i) 


To  verify  the  expression  preceding  (2.1),  note  that  the  combinatorial 

_ n! _ 

1!  ...  l!(n  -  k)! 


coefficient 


represents  the  number  of  ways  of  choosing 
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one  observation  to  correspond  to  each  of  . ^nd 

(n  -  k)  observations  for  the  (n  -  k)  unobserved  failure  times,  from 
among  the  n  failure  times  (of  which  only  the  first  k  are  actually 
observed).  The  product  factor  represents  the  joint  density  of  the  k 
actually  observed  failure  times,  given  A  .  Finally,  the  last  factor 
represents  the  probability  that  (n  -  k)  lifelengths  each  exceed  x^j^j  , 
given  X  . 

The  expression  In  the  exponent  of  (2.1): 


I  x^^j  +  (n  -  k)X(j^j  -  nx^^j  +  (n  -  l)(x^2)  '  ’'(!)> 


(2.2) 


+  ...  +  (n  -  k  +  l)(X(j^j  -  x^j^_jj)  T(x^^j) 

represents  the  total  time  on  teat  until  the  k^^  failure.  Note  that  it 
is  comprised  of  nx^^  the  total  time  on  teat  observed  until  the  first 
failure;  of  (n  -  l)(x^2)  “  *(1)^  *  time  on  test  observed  between 

the  first  and  second  failures;  ...  ;  and  of  (n  -  k  +  l)(x^j^j  -  , 

the  total  time  on  test  observed  between  the  penultimate  and  the  last 
observed  failures.  (Of  course.  It  Is  understood  that  after  an  item  falls. 

It  Is  no  longer  under  observation.) 

The  total  time  on  test  statistic  turns  out  to  be  a  very  Important  and 
useful  statistic  not  only  In  the  exponential  model,  but  also  after  appro¬ 
priate  generalization,  in  a  large  number  of  other  models  involving  incomplete 
data.  In  the  exponential  model,  the  total  time  on  test  and  the  number  of 
observed  failures  constitute  a  sufficient  statistic  for  X  .  In  the  pre¬ 
sently  considered  sampling  plan  under  which  k  ,  the  number  of  observed 
failures  is  specified  in  advance,  we  see  from  (2.1)  chat  T(Xj^)  alone  is 


sufficient  for  X  . 

Suppose  we  assume  a  gamma  prior  on  X  : 
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ir(A)  ■  ^^/r(a)  . 

Using  (2.1),  we  obtain  for  the  posterior; 

.  .  .  .  ,  -A[b+T(x..  .)] 

(2.3)  r(A  1  D)  -  [b  +  /r(k  +  a)  , 


also  a  gamma,  but  with  the  shape  parameter  a  of  the  prior  density  replaced 
by  a  +  k  and  the  scale  parameter  b  of  the  prior  density  replaced  by 


b  +  T(x^j^j)  .  Note  that  the  increment  in  the  scale  parameter  is  * 

the  observed  total  time  on  test.  The  mode  of  the  posterior  density  is: 


A°  -  (k  +  a  -  l)/tb  +  T(x^j^^)]  . 


It  is  interesting  to  note  that  for  ir(A)  B  c  ,  the  mode  of  the  posterior  is 
exactly  the  well  known  MLE: 

A  -  k/T(x^j^p  . 

However,  it  should  be  emphasized  that  this  prior  is  improper  in  the  sense 

that  f  Tr(A)dA  ■  «  . 

0 

We  would  expect  that  after  collecting  a  set  of  data  from  the  expo¬ 
nential  distribution,  we  would  have  more  Information  concerning  the  unknown 
parameter  A  than  before;  more  precisely,  the  peakedness  of  the  density 
of  A  might  Increase  or  the  coefficient  of  variation  might  decrease.  The 
aoeffCaient  of  vcarUiticn  of  a  distribution  is  the  ratio  of  the  standard 
deviation  (assumed  finite)  to  the  absolute  value  of  the  mean  (assumed  non¬ 
zero).  In  Che  case  of  the  gamma  prior  density,  the  mean  is  a/b  ,  the 

2  -if 

variance  is  a/b  ,  and  so  the  corresponding  coefficient  of  variation  is  a  ^ 
Under  the  present  sampling  plan,  Che  posterior  distribution,  given  in 
(2.3),  is  also  a  gamma  distribution,  but  with  the  shape  parameter 
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(the  updated  "a"  value)  now  a  -f  k  .  It  follows  that  the  coefficient 

-1/2 

of  variation  is  now  reduced  to  (a  4-  k)  .  Thus  for  fixed  a  ,  the 

-1/2 

coefficient  of  variation  decreases  roughly  as  k  ,  where  k  denotes 
the  observed  number  of  failures.  This  simple  calculation  gives  us  a 
quantitative  notion  as  to  our  relative  uncertainty  concerning  X  both 
before  and  after  observation,  and  therefore  concerning  the  decrease  in 
our  uncertainty. 

Sampling  Plan  (b).  Observation  Terminated  at  Fixed  Time  (Truncated 
Sampling) 

Suppose  n  units  are  put  on  life  test  at  time  t  •  0  and  each  is 

observed  until  failure  or  fixed  time  t  ,  whichever  occurs  first.  Given 

o 

the  random  outcome  K  »  k  (0  <  k  <  n)  observed  failures  in  [0,t  ]  , 

—  —  o 

Che  corresponding  likelihood  is  given  by: 


(2.4)  L(X  1  D) 


1! 


l!(n  -  k)! 


k 

n  Xe 
i«l 


-Xx 


(i) 


-X (n-k) t 


where  Che  product  in  square  brackets  is  defined  as  1  for  k  >  0  .  The 
verification  of  the  likelihood  expression  (2.4)  is  similar  to  that  ob¬ 
tained  under  Che  previous  sampling  plan,  leading  to  the  expression  pre¬ 
ceding  (2.1).  One  key  difference  is  that  the  number  n-k  surviving 
fixed  time  C^  under  the  present  plan  is  random,  while  the  number  n-k 
surviving  past  Che  k^^  failure  under  the  earlier  plan  is  specified  in 
advance. 

We  may  rewrite  Che  likelihood  L(X  |  D)  as: 

X(i)  + 


(n  -  k) ! 


■X  exp  {-X 


k 

I 

i-1 


(2.5)  L(X  I  D) 
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It  is  clear  from  (2.5)  and  Lemma  1.5  that  the  pair  k  and  the  total 
time  on  test: 


’  Jl  ‘ 

are  sufficient  statistics  for  \  .  Under  the  present  sampling  plan, 
both  k  and  the  ^(t^)  observed,  and  thus  together  constitute  the 

data  D  . 

From  (2.5)  the  MLE  Is  computed  as: 

X  -  k/T(t  )  . 
o 

Under  the  present  sampling  plan  It  Is  possible  to  observe  k  >  0  failures, 
leading  to  a  MLE  of  0  .  Such  an  estimate  Is  Intuitively  unsatisfactory; 

In  this  situation  an  analysis  based  on  the  posterior  distribution  of  X 
Is  preferable. 

The  posterior  density  resulting  from  a  gamma  prior  [see  (1.4)]  Is, 
using  (2.5): 

I  k+a  k+a-1 

(2.6)  :r(X  I  D)  -  [b  +  T(t^)]  X*^  ®  e  °  /r(k  +  a)  . 

Just  as  In  the  previous  sampling  plan,  the  posterior  coefficient  of 

-1/2  -1/2 
variation  Is  (a  +  k)  ,  which  Is  smaller  than  a  ,  the  prior 

coefficient  of  variation. 

Sampling  Plan  (c).  Inverse  Binomial  Sampling 

A  unit  Is  put  on  test  until  It  fails  or  reaches  a  specified  age 

t  ,  whichever  occurs  first.  At  this  time,  the  unit  Is  replaced  by  a 
o 

new  unit.  This  procedure  Is  repeated  sequentially  until  k  (specified 
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in  advance)  failures  are  actually  observed.  The  number  N  of  units 
that  have  to  be  tested  until  k  failures  are  actually  observed  is,  of 
course,  a  random  variable.  For  example,  we  may  have  only  one  test 
chamber  and  we  may  be  able  to  test  at  most  one  unit  at  a  time. 

Let  Y  «  min  (X,t^)  ,  where  X  has  exponential  density  Xe"^*  , 

X  ^  0  .  Then  conditional  on  Y  <  t^  and  T  ,  the  density  of  Y  is 
given  by; 


Xe“^y/(l  -  e 


for  0  <  y  <  t 
~  —  o 


Thus,  if  the  successive  failure  ages  actually  observed  are  denoted  by 

^1*^2’  ■*’*  corresponding  conditional  joint  density  is  given 

by; 


h  i  J 


It  X  exp 

The  probability  that  N  ■  n  units  have  to  be  tested  in  order  to  observe 
k  actual  failures  is  given  by 


PIN  -  n  I  AI  -  (2  ;  J)(l  -  .■''<>) 


-Xt^\k  -Xt^(n-k) 

e  for  n  >  k  . 


It  follows  that,  given  N  ■  n  and  observed  failure  ages  Y]_’^2*  *’’’  t 
the  corresponding  likelihood  is: 


L(A  1  D)  "  (k  I  exp  |-X  I  exp  t-Xt^(n-k)]  ,  0  <  y^  < 


t^  ,  i  -  1,  ....  k 


Combining  exponentials  we  obtain; 


24 


(2.7) 


\  k  1 

( 

k 

jx  exp 

1‘ 

1  y^  +  (n  -  k)t^ 

0  <  j  i  •  Xf  •••t  It 


From  Lemma  1.5,  we  conclude  that  the  total  time  on  teat: 


T  -  I  y,  +  (n  -  k)t 
i-1  ^ 


ia  by  itself  sufficient  for  \  since  k  is  fixed  in  advance. 

From  (2.7)  we  also  obtain  the  posterior  density  for  X  based  on  a 
gamma  prior  density  as: 

(2.8)  ir(X  1  D)  -  (b  +  T)’‘'^*x‘‘'^®~^e"^^^’'Vr(k  +  a)  . 

Note  the  striking  resemblance  in  the  posterior  density  under  the  three 
samplings i)lans  so  far  considered!  (Compare  (2.8),  (2.6^,  and  (2.3).) 

Remarks ; 

In  the  three  sampling  plans  considered  so  far,  the  number  k  of 
observed  failures  and  the  total  time  on  test  T  are  all  that  we  need 
from  the  observations  in  order  to  complete  our  data  analysis;  the  suffi¬ 
ciency  of  k  and  T  makes  this  true.  Note  that  this  fact  holds  for 
any  choice  of  a  prior  density. 

The  "stopping  rule"  in  each  of  the  three  test  plans  considered 
gives  no  information  about  the  parameter  X  ,  l.e.,  is  noninformative 
about  X  .  (As  the  name  implies,  the  stopping  rule  is  simply  the  rule 
which  specifies  when  the  testing  is  to  stop.  For  example,  in  the 
sampling  plan  (a)  we  test  until  k  failures  are  observed  and  then  stop 
further  observation.)  If  the  stopping  rule  were  to  give  information 


about  the  parameter  X  ,  then  the  total  time  on  test  T  and  the  observed 
number  of  failures  would  not  be  sufficient  for  X  . 

Finally,  note  that  in  the  test  plans  considered  thus  far,  the  MLE 
is  the  ratio  of  the  number  of  observed  failures  to  the  total  time  on 
test.  This  simple  formula  for  the  MLE  holds  in  most  of  the  testing  pro¬ 
cedures  followed  under  the  exponential  model,  as  we  see  in  this  chapter 
and  shall  see  in  Chapter  2. 
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3.  INFERENCE  BASED  ON  MEAN  LIFE 

Thus  far  we  have  discussed  Inference  for  the  exponential  distri¬ 
bution  based  on  the  failure  rate  X  .  For  many  analysts,  the  mean  life 
9  of  the  exponential  distribution  may  seem  to  be  the  more  appropriate 
parameter  to  estimate.  Note  that  either  parameter  determines  the  other 
in  the  exponential  model  since: 


0  - 

Suppose  we  consider  the  simplest  type  of  testing  plan:  n  units 
are  put  on  test  and  observed  until  each  fails.  The  corresponding  mutually 
Independent  llfelengths  given  X  are  Xj^.x^*  ....  x^  ,  constituting  a  complete 
sample  from  the  exponential  density: 

f(x  I  0)  -  0"^e‘*^®  . 

The  likelihood  of  the  outcome  is  given  by: 


(3.1)  L(9  1  x^. 

Clearly,  the  MLE  of  0 


is  given  by: 


9 


Suppose  now  we  have  very  little  prior  information  on  the  parameter 
0  .  Ue  therefore  assume  a  veotangulav  prior  density  on  0  : 

x(0)  -  for  0  £  9  £  M  , 


.7^ 
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where  M  is  large. ^  The  corresponding  posterior  density  for  0  may 
be  computed  approximately,  by  Bayes*  Theorem  (Theorem  1.2),  as: 


(3.2) 


^  I  *2^ »  •  •  • » 


j,ag-(a+l)e-b/e/r(a) 


for  6  ,  a  ,  b  >  0  ,  where  now  a  ■  n  -  1  and  b  ■  x.  . 

1 

The  density  of  (3.2),  denoted  by  ^^(6)  ,  is  called  the  inverted  gaama 

density t  since  if  6  is  a  random  variable  with  density  r  .  (9)  ,  then 

a,o 

6^*1  has  gamma  density  (1.5). 

We  may  verify  readily  that  if  our  inltal  prior  is  of  the  form 
IT  .  (6)  ,  of  (3.2)  and  we  use  any  one  of  the  sampling  plans,  then  the 

o  f  O 

corresponding  posterior  density  is  also  of  the  form  (3.2).  However, 
the  parameter  a  of  the  prior  is  replaced  by  a  ■**  k  in  the  posterior, 
and  the  parameter  b  of  the  prior  is  replaced  by  b  T  in  the  post¬ 

erior.  As  before,  k  denotes  the  number  of  observed  failures  and  T 

1$  the  total  time  on  test.  This  follows  readily  from  the  likelihood 

n 

expression  (3.1)  and  the  fact  that  in  this  case  T  -  ^  x.  since  all 

1  ^ 

lifelengths  are  observed. 


The  Mean  of  the  Posterior  Density 

The  mean  of  the  inverted  gamma  density  given  in  (3.2)  is  readily 


It  would  be  Inconsistent  mathematically  to  assume  a  rectangular  prior 

.-1 

on  both  \  and  9  •  X  ,  even  though  we  have  very ^little  prior  infor¬ 
mation  on  both  parameters.  (The  implications  for  9  that  result  from 
assuming  a  rectangular  prior  for  are  brought  out  in  Exercise  1.) 

We  assume  a  rectangular  prior  for  9  here  to  motivate  the  use  of  a  natural 
conjugate  prior. 


w 
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calculated : 


f  e{b®0'^®‘^^^e~'’^®/r(a)}de  -  b/(a  -  1)  . 

For  Che  Inverted  gamma  posterior  density  in  which  the  parameters  are 
a  +  k  (In  place  of  a  )  and  b  4-  T  (In  place  of  b  ) ,  the  corresponding 
mean  takes  the  form 


(3.3) 


b  +  T 
a  +  k  -  1 


(1  -  w) 


> 


tdiere  w»k/(k  +  a-l)  .  Thus  the  mean  of  the  posterior  density  may 
be  written  as  a  convex  combination  of  the  prior  mean  b/(a  -  1)  and 
the  maximum  likelihood  estimate  T/k  of  the  exponential  life  distri¬ 
bution  mean.  Note  chat  as  k  ,  the  number  of  observed  failures  Increases, 
the  posterior  mean  attaches  more  weight  to  the  MLE  of  the  true  mean  and 
less  weight  to  the  prior  mean. 

Table  3.1  summarizes  the  properties  of  Che  natural  conjugate  prior 
density  and  of  the  corresponding  posterior  density  for  the  two  different 
parametrlzaclons  of  Che  exponential  model. 
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TABLE  3.1 

COMPARISON  OF  ALTERNATIVE 
PARAMETERIZATIONS  OF  THE  EXPONENTIAL  MODEL 


Parameter 


Likelihood 


Natural  Con¬ 
jugate  Prior 


Prior  Mean 


Prior  Variance 


Prior  Coefficient 
of  Variation 


Posterior  Mean 


Posterior  Variance 


Posterior  Coeffi¬ 
cient  of  Variation 


Failure  Rate ,  X 


,k  -XT 
X  e 


^"1  *hX  /../  x 

b  X  e  /r(a)  , 
a,b  >  0  (Gainma) 


a/b 


a/b" 


-1/2 


(a  +  k)/(b  +  T) 


a  k 
(b  +  T)' 


(a  +  k) 


-1/2 


Mean  Life ,  9 


,-k,-T/e 


a,b  >  0  (Inverted  Ganuna) 


b/(a  -  1) 
a  >  1 


b^/[(a  -  l)^(a  -  2)1 
a  >  2 


(a  -  2) 
a  >  2 


-1/2 


(b  +  T)/(a  +  k  -  1) 
a  >  1 


(b  +  T)‘ 


(a  +  k  -  l)‘(a  +  k  -  2) 
_ a  >  2 _ 


(a  +  k  -  2) 
a  >  2 


-1/2 


Note:  k  «  number  of  observed  failures, 
T  >  total  time  on  test. 
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Arbitrary  Prior  Density 

In  our  analysis  up  to  now,  we  have  focused  mostly  on  the  case  In 
which  the  prior  density  Is  the  natupdl  conjugate  prior.  In  this  sub¬ 
section,  we  expand  our  consideration  to  cases  In  which  the  prior  Is 
not  necessarily  the  natural  conjugate  prior.  We  obtain  results  similar 
to  those  holding  In  the  natural  conjugate  prior  case. 

Let  IT (6)  be  a  prior  density  on  such  that  {0  ]  ir(6)  >  0}  is 
an  Interval  on  [0,«)  .  We  show  In  Appendix  A  of  this  chapter  that 


(3.4)  Pie  >  6  I  k,T]  -  /  ir(0  I  k,T)d0 

o 

is  decreasing^  in  k  >  0  for  fixed  T  and  increasing^  in  T  for  fixed 
k  ,  l.e.,  the  posterior  random  variable  0  Is  stochastically  decreasing 
in  k  and  stochastically  increasing  in  T  ,  In  particular, 

El0  I  k  •  0,T]  i  E[e  1  k  -  0  ,  T  -  0]  ; 

the  lower  bound  Is,  of  course,  the  mean  of  the  prior  density.  Thus, 
observing  total  time  on  test  without  observing  failures  tends  to  change 
our  belief  about  0  as  compared  with  our  prior  belief;  we  tend  to  be¬ 
lieve  In  a  larger  0  .  However,  for  the  natural  conjugate  prior,  the 
variance  of  0  given  k  and  T  ,  decreases  with  k  but  increases 
with  T  .  Also  the  coefficient  of  variation  Is  constant  in  T  but 

decreases  with  k  .  (See  Table  3.1.)  Hence  If  k  >  0  ,  large  values  * 

of  T  tend  to  make  us  optimistic  regarding  0  .  However,  failures  are 

,-l 

needed  to  sharpen  the  posterior  density.  Similarly,  for  X  ■  0  ,  we 

t 

Teminology:  Throughout  the  book  we  use  the  term  "increasing"  In  place  of 
"nondecreasing"  and  "decreasing"  in  place  of  "nonincreasing." 

} 

k 

1. 

i 

{ 

« .  wifL 


can  show  chat  for  a  general  prior  density  on  X  under  mild  regularity 
conditions, 

P(X  >  X^  t  It.Tl 

inaveaaes  in  k  for  fixed  T  and  deoreaBee  in  T  for  fixed  k  ,  just 
as  we  would  expect. 

3.1  Example 

Suppose  our  prior  density  on  e  Is  the  natural  conjugate  prior 
with  a  ■  4  and  b  »  12  .  We  put  10  units  on  test.  The  earliest  fail¬ 
ure  occurs  at  -  1  ,  followed  by  a  withdrawal  at  t  «  2  ,  and 

finally  a  second  failure  at  X^2)  ■  3  .  In  Figure  3.1,  we  plot  the 
posterior  mean,  posterior  standard  deviation,  and  posterior  coeffi¬ 
cient  of  variation  as  a  function  of  t  ,  the  test  time  elapsed.  Table  3.1 
may  be  used  to  generate  the  plots. 

In  Figure  3.2  we  have  plotted  the  posterior  density  for  9  at 
selected  times  during  Che  life  test.  The  posterior  density  for  t  -  0 
is  of  course  Che  prior  density.  Notice  the  shape  of  the  posterior  density 
at  c  >  1  (l.e. ,  just  before  Che  first  observed  failure)  and  at  t  <■  1 

(l.e. ,  just  after  the  first  failure). 


1 


X 


Coefficient  of  Variation 


0  .2  .4  .6  .8  i.O  1.2  1,4  1.6  1.8  2.0  2.2  2.4  2.6  2.8  3.0  3.2 


X 


(1) 


1 


*(2) 


time 


FIGURE  3.1 

POSTERIOR  MEAN,  POSTERIOR  STANDARD  DEVIATION,  AND  POSTERIOR  COEFFICIENT 


OF  VARIATION  AS  A  FUNCTION  OF  ELAPSED  TEST  TIME 
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4.  NOTES  AND  REFERENCES 
Section  1; 

Books  which  emphasize  Bayesian  concepts  as  well  as  their  applica¬ 
tions  are  DeGroot  (1970)  and  Winkler  (1972).  In  a  series  of  papers, 

D.  Basu  (1975),  (1977),  (1978),  points  out  the  Inadequacy  of  other 
approaches  to  statistical  Inference.  Definition  1.4  of  sufficiency  is 
attributed  by  D.  Basu  to  Kolmogorov  (1942) .  The  connection  between  sam¬ 
ple  theory  (Fisher)  sufficiency  and  Kolmogorov  sufficiency  is  discussed 
in  Shih-Chuan  Cheng  (1978).  Fisher  sufficiency  implies  Kolmogorov  suffi¬ 
ciency.  The  converse  is  false  in  general.  Natural  conjugate  priors  were 
introduced  by  Raiffa  and  Schlalfer  (1961).  For  a  rigorous  characteri¬ 
zation  of  natural  conjugate  priors,  see  Diaconis  and  Ylvisaker  (1979). 

The  reviews  of  Bayesian  statistics  by  Lindley  (1972)  and  (1978)  present 
excellent  summaries  of  recent  advances  in  the  subject. 

Section  2; 

Epstein  and  Sobel  (1953),  (1954)  were  the  first  to  investigate  the 
properties  of  the  exponential  model  applied  to  life  test  plans.  In  a 
series  of  papers  they  made  an  intensive  and  extensive  study  of  the 
statistical  features  of  a  variety  of  exponential  procedures  for  life 
testing.  Their  work  greatly  Influenced  reliability  theory  and  practice 
at  the  time  and  still  strongly  Influences  current  statistical  practice 
in  reliability.  Government  Handbook  H-108  and  its  subsequent  modifica¬ 
tions  represent  the  Government's  "seal  of  approval"  and  its  effort  to 
Implement  the  theory  by  making  readily  available  tables  and  graphs  for 
easy  use  of  the  exponential  model. 
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Section  3; 

Stochastic  monotonicity  properties  of  the  posterior  mean  are 
derived  using  the  concepts  and  methods  of  total  positivity.  (A 
comprehensive  and  authoritative  treatment  of  total  positivity  may  be 
found  in  Karlin  (1968).)  Theorem  A. 1  in  the  appendix  is  similar  to 
Lemma  2,  p.  74,  of  Lehmann  (1939). 

In  the  mathematical  Insurance  literature  formula  (3.3)  is  called 
the  credibility  formula.  Jewell  (1977)  and  (1978)  has  discussed  f 

Bayesian  life  testing. 
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APPENDIX  A 

POSTERIOR  DISTRIBUTIONS  CORRESPONDING  TO  ARBITRARY 
PRIORS  AND  LIKELIHOODS  WITH  THE  MONOTONE  RATIO  PROPERTY 


We  will  prove  Che  results  mentioned  In  Section  3. 

A.l  Theorem 

Let  a  prior  density  on  (§)  such  that  {6  |  ^  O 

Is  an  Interval  In  •  Lac  data  D  *  (k,T)  and  likelihood 

L(6  I  k,T)  be  given  such  chat  for  8^  <  02 

L<0j^  1  k.T)/L(e2  1  k.T) 

Is  Increasing  In  k  and  decreasing  In  T  .  Let  g(®)  be  Increasing 
In  6  .  Then 


(A.1) 


I 


g(0)ir(0  I  k,T)d0 


Is  decreasing  In  k  (T  fixed)  and  Increasing  In  T  (k  fixed) . 
The  proof  will  be  given  shortly. 


A. 2  Corollary 

If  L(9  1  k,T)  «  0"*^e"^^®  ,  then 


(A.  2) 


Pie  >  0 

o 


k.T] 


ir(e  1  k.Tjde 


Is  decreasing  In  k  (T  fixed)  and  Increasing  in  T  (k  fixed).  [See 
Section  3  and  (3.4).] 


Proof ; 


In  Theorem  A.l,  lec  g(9)  =  1  for  6^8^  and  0  otherwise.]  | 

A.  3  Corollary 

k  “XT 

Let  L(X  I  k,t)  «  X  e  and  g(X)  be  Increasing  in  X  .  Then 
(A.3)  j  g(X)ir(X  I  k,T)dX 

•4 

is  increasing  in  k  (T  fixed)  and  decreasing  in  T  (k  fixed). 

Clearly  (A.l)  [A.3]  implies  that  all  moments  of  the  posterior 
distribution  are  decreasing  [increasing]  in  k  and  increasing  [decreasing] 
in  T  . 

Proof  of  Theorem  A.l; 

For  ^  ®2  ’ 

L(9j^  I  k,T)/L(02  1  k,T) 


increasing  in  k  implies  that  for  k^  <  k^ 


L(9  1 

kj^,T)/L(9  1 

k2,T) 

is  increasing  in  0  . 

It  is 

easy  to  see 

that  this  in  turn  Implies  that: 

(A.  4) 

ir(0  1 

k^,T)/lT(0  1 

k^.T) 

is  also  increasing  in 

9  . 

Let 

A  -  [0  I  ir(9  I  k^,T)  >  it(0  I  kj.T)) 
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B  -  {0  I  it(0  (  kj^.T)  <  it(0  I  k2.T)}  . 

Let  a  «  Inf  g(0)  and  b  >  sup  g(0)  .  Then  the  ratio  in  (A. 4)  Is 
0eA  0eB 

Increasing  in  0  implies  that  a  ^  b  . 

How 


i 


g(e)[ir(0 


k^.T)  -  ir(0 


k2,T)]d0 


k2,T)]d0 


k2,T]d0 


■  (a  -  b) 


k^.T) 


it(0  I  k2,T)]d0  >  0  . 


Hence >  the  integral  in  (A.l)  is  decreasing  in  k  . 

A  similar  argument  proves  that  the  integral  in  (A.l)  is  Increasing 
in  T  . 1 1 


